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Abstract 


In  this  report,  an  information  theoretic  criterion  based  approach  for  estimating  the 
number  of  emitters  from  a  set  of  interleaved  pulse  trains  is  proposed.  In  the  approach,  a 
new  pulse  signal  model  is  formulated  to  handle  large  numbers  of  pulses.  The  approach 
is  based  on  the  application  of  the  general  information  criterion  (GIC)  and  has  the 
advantage  of  not  requiring  any  threshold  setting  procedures.  Compared  to  the  classical 
information  theoretic  criterion  based  approaches,  the  general  information  theoretic 
criteria  based  approach  is  more  versatile  and  does  not  involve  any  computationally 
sophisticated  maximum  likelihood  estimator.  Computer  simulations  are  used  to 
demonstrate  the  effectiveness  of  the  proposed  approach. 


Resume 


Nous  proposons  dans  ce  rapport  une  nouvelle  approche  pour  estimer  le  nombre 
d’emetteurs  a  partir  d’une  serie  de  trains  d'impulsions  entrelaces.  Nous  formulons  un 
nouveau  modele  de  signaux  capable  de  traiter  un  grand  nombre  d'impulsions. 

L' approche  est  basee  sur  l'application  du  Critere  d’Information  General  et  a  conime 
avantage  de  ne  pas  necessiter  de  procedures  d'ajustement  de  seuils.  Notre  approche  est 
plus  souple  par-  rapport  aux  approches  basees  sur  les  criteres  classiques  de  la  theorie  de 
1’ information,  et  n’implique  aucun  estimateur  de  probabilite  maximum  sophistique. 
L’efficacite  de  cette  approche  est  demontree  par  des  simulations  faites  a  l’ordinateur. 
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Executive  summary 


Background:  In  a  practical  signal  environment,  a  wide-open  ESM  system  with  full 
angular  coverage  will  receive  a  sequence  of  signal  pulses  interleaved.  In  order  to 
identify  the  emitters,  the  pulse  trains  need  to  be  deinterleaved.  Conventional 
approaches  for  deinterleaving  arc  based  on  the  use  of  pulse  parameters  such  as  DOA, 
RF,  TOA,  PW,  PRI  and  polarization.  They  are  efficient  for  simple  emitters  with 
relatively  constant  frequency  and  PRI,  and  well  separated  parameters.  However,  they 
encounter  difficulties  in  dense  signal  environments  where  emitters  have  similar 
parameters.  The  presence  of  similar  emitters  plus  inaccuracy  in  signal  parameter 
measurement  by  the  receivers  will  cause  ambiguities  in  deinterleaving.  Furthermore,  as 
modern  radar  adopts  more  and  more  complex  digital  techniques  and  sophisticated 
waveform  modulations,  conventional  techniques  become  even  more  vulnerable. 

Recently,  there  has  been  increasing  interest  in  developing  deinterleaving  techniques 
based  on  the  use  of  intrapulse  information  of  the  pulses.  Intrapulse  carries  rich 
information  about  the  emitter  characteristics,  signal  propagation  path  and  the 
environment.  The  intrapulse  information  describes  the  pulse  more  completely  and  more 
uniquely  than  the  conventional  pulse  parameters.  In  this  report,  we  focus  on  the 
problem  of  estimating  the  number  of  emitters  from  a  set  of  pulse  trains  based  on  the  use 
of  intrapulse  information,  which  is  a  pre-requisite  for  pulse  deinterleaving. 

Results:  Previously,  the  author  had  developed  a  novel  approach  for  estimating  the 
number  of  emitters  based  on  the  minimum  description  length  (MDF)  criterion. 
However,  the  approach  has  a  drawback  in  handling  large  numbers  of  pulse.  It  would 
involve  heavy  computation  when  the  number  of  pulses  increases.  In  this  report,  we 
extended  the  approach  to  handling  large  numbers  of  pulses.  The  main  points  arc  as 
follows. 

•  The  signal  model  is  re-formulated  for  accepting  large  numbers  of  pulses. 

•  The  general  information  theoretic  (GIC)  criteria  arc  applied  to  estimating  the 
number  of  emitters.  Performace  analysis  was  carried  out  and  it  was  shown  that  the 
approach  is  statistically  consistent. 

•  Computer  simulations  arc  used  to  demonstrate  the  effectiveness  of  the  approach. 

Significance:  The  approach  has  overcome  the  difficulties  encountered  by  the 
previously  developed  method.  It  is  capable  of  handling  large  numbers  of  pulses, 
making  it  attractive  for  real-time  applications.  It  is  shown  that  the  GICs  arc  generalized 
versions  of  the  regular  information  theoretic  criteria  with  the  same  advantages,  i.e.,  they 
do  not  require  any  subjective  threshold  setting.  In  addition,  since  the  GICs  do  not 
involve  any  computationally  sophisticated  maximum  likelihood  estimator,  it  is  efficient 
in  terms  of  computation  complexity  and  time.  In  summary,  the  proposed  approach  is  a 
good  candidate  for  use  in  a  modern  autonomous  ESM  system. 
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Sommaire 


Contexte:  Dans  un  bon  environnement  pour  les  signaux,  un  systeme  ESM  ouvert  a 
couverture  angulaire  complete  recevra  une  serie  d'impulsions.  Afin  d'identifier  les 
emetteurs,  les  trains  d’impulsions  doivent  etre  desentrelaces.  Certaines  approches 
conventionnelles  utilisent  des  parametres  d' impulsion  coniine  leur  DOA,  RF,  TOA, 

PW,  PRI  et  la  polarisation.  Ceux-ci  sont  efficaces  pour  des  emetteurs  simples  ayant  des 
frequences  et  PRI  relativement  constants  et  des  parametres  bien  scparcs.  Cependant,  ils 
ne  sont  pas  tres  efficaces  dans  des  environnements  a  signaux  denses  ou  les  emetteurs 
ont  des  parametres  semblables.  La  presence  d’emetteurs  similaires  et  d'imprecision 
dans  les  mesures  causent  des  ambiguites  au  desentrelacement.  De  plus,  etant  donne  que 
les  radars  modernes  adoptent  des  techniques  de  plus  en  plus  complexes  et  des 
modulations  d'ondes  sophistiquees,  les  methodes  conventionelles  deviennent  encore 
plus  vulnerables. 

Le  developpement  de  techniques  de  desentrelacement  faisant  usage  de  P  information 
intra-impulsion  suscite  un  grand  interet  depuis  peu.  L’information  intra-impulsion  est 
une  source  riche  d’ information  des  caracteristiques  des  emetteurs,  des  chemins  de 
propagation,  ainsi  que  de  P  environnement.  Elle  decrit  les  impulsions  avec  beaucoup 
plus  de  precision  que  les  methodes  conventionnelles.  Dans  ce  rapport,  nous  examinons 
le  probleme  de  P estimation  du  nombre  d’emetteurs  a  partir  des  trains  d'impulsions  et 
de  P  information  intra-impulsion.  II  s’agit  la  d’un  pre-requis  pour  le  desentrelacement 
dimpulsions. 

Resultats:  Dans  le  passe,  P auteur  a  developpe  un  moyen  d'estimer  le  nombre 
d'emetteurs  en  se  basant  sur  le  critere  de  la  longueur  de  description  minimale  (MDL). 
Cependant,  cette  approche  est  limitee  lorsqu’elle  est  doit  traiter  un  grand  nombre 
d'impulsions.  Le  nombre  de  calculs  augmente  beaucoup  lorsque  la  quantite  de  signaux 
est  elevee.  Dans  ce  rapport,  nous  avons  etendu  P  approche  afin  de  traiter  des  impulsions 
nombreuses.  Les  points  principaux  sont  les  suivants. 

•  Le  modele  de  signal  est  re-formule  pour  accepter  un  grand  nombre  d'impulsions. 

•  Le  critere  base  sur  la  theorie  d’ information  generale  (GIC)  est  applique  pour 
estimer  le  nombre  d'emetteurs.  L’ analyse  de  la  performance  de  cette  approche  fut 
analysee  et  sa  consistance  statistique  fut  demontree. 

•  Des  simulations  sont  utilisees  pour  demontrer  l’efficacite  de  l’approche. 

Interpretation:  L’approche  a  surmonte  les  difficultes  rencontrees  par  la  methode 
precedente.  Elle  peut  traiter  un  grand  nombre  d'impulsions,  ce  qui  la  rend  attrayante 
pour  des  applications  en  temps  reel.  Nous  demontrons  que  les  GICs  sont  des  versions 
generalisees  des  criteres  bases  sur  la  theorie  de  linformation  avec  les  memes  avantages 
de  ceux-ci,  e’est-a-dire  qu’ils  n’ont  pas  besoin  de  procedure  d’ajustement  de  seuils.  En 
plus,  comme  les  GICs  n'impliquent  aucun  calcul  sophistique  pour  les  estimateurs  de 
probabilite  maximale,  ils  sont  plus  rapides.  En  conclusion,  P  approche  proposee  est  une 
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possibilite  que  nous  pourrions  utiliser  avec  succes  dans  un  systeme  ESM  autonome 
moderne. 
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1.  Introduction 


In  ESM  (electronic  support  measures)  and  ELINT  (electronic  intelligence) 
applications,  receivers  arc  used  to  intercept  radar  signals  and  perform  emitter 
classification  and  identification  for  electronic  warfare  (EW)  purposes.  In  a  typical 
signal  environment,  since  there  arc  usually  more  than  one  emitters  out  there 
transmitting  pulsed  signals,  a  wide-open  EW  receiver  with  full  angular  coverage  will 
receive  a  sequence  of  signal  pulses  interleaved.  In  order  to  identify  the  emitters,  the 
pulse  trains  need  to  be  deinterleaved,  i.e.,  to  separate  the  pulse  trains  into  individual 
emitter  groups,  where  pulses  in  one  group  belong  to  same  emitter  [1][2].  Conventional 
approaches  for  deinterleaving  are  based  on  the  use  of  pulse  parameters  such  as 
direction-of-arrival  (DOA),  radio  frequency  (RF),  time-of-arrival  (TOA),  pulse  width 
(PW),  pulse  repetition  interval  (PRI)  and  polarization  [1],  They  arc  efficient  for  simple 
emitters,  i.e.,  emitters  with  relatively  constant  frequency  and  PRI  from  pulse  to  pulse, 
and  well  separated  parameters.  However,  they  encounter  difficulties  in  dense  signal 
environments  where  emitters  have  similar  parameters.  The  presence  of  similar  emitters 
plus  inaccuracy  in  signal  parameter  measurements  by  the  receivers  will  cause 
ambiguities  in  deinterleaving.  In  addition,  as  modern  radars  adopt  more  and  more 
complex  and  sophisticated  digital  techniques,  they  use  complex  waveform  modulations, 
making  the  conventional  techniques  even  more  vulnerable. 

The  drawbacks  of  the  conventional  deinterleaving  approaches  arc  due  to  the  fact  that 
the  conventional  pulse  parameters  arc  not  sufficient  for  uniquely  identifying  a  pulse.  To 
counter  these  difficulties,  there  has  been  increasing  interest  in  developing 
deinterleaving  techniques  based  on  the  use  of  intrapulse  information  of  the  pulses 
[2]  [3]  [4].  Intrapulse  refers  to  the  shape  of  the  pulse  which  includes  the  pulse  envelope 
and  phase  variation  within  a  pulse.  It  carries  rich  information  about  the  emitter 
characteristics,  signal  propagation  path  and  the  environment  [1],  The  intrapulse 
information  describes  the  pulse  more  completely  and  more  uniquely  than  conventional 
pulse  parameters,  and  is  considered  to  be  stable  in  the  sense  that,  for  a  short  period  of 
time  (at  the  order  of  milliseconds),  the  shape  of  the  pulses  signals  from  the  same 
emitter  will  not  change  significantly.  The  intrapulse  approaches  are  sometimes  called 
finger  printing  by  EW  engineers  [1],  They  arc  closely  related  to  the  unsupervised 
learning  in  pattern  recognition  such  as  cluster  analysis  [3],  fuzzy  clustering  algorithms 
[4]  and  neural  networks  [5]  [6].  Note  that  intrapulse  may  also  refer  to  features  extracted 
from  the  pulse  waveforms.  Granger  et  al.  [7]  studied  the  application  of  four  different 
self-organizing  neural  networks  for  automatic  deinterleaving  of  radar  pulses.  In  [8], 
Kamgar-Parsi  at  al.  proposed  a  neural  network  approach,  which  they  referred  to  as  the 
Hopficld-Kamgar  clustering  technique,  for  sorting  the  pulse  feature  vectors  using  the 
Hopfield  networks.  These  unsupervised  algorithms  arc  all  adaptive  and  can 
autonomously  analyze  the  pulse  data  and  categorize  them.  However,  the  performance 
of  the  algorithms  is  dependent  on  the  correct  estimation  of  the  number  of  emitters  out 
there.  The  self-organizing  algorithms  tend  to  produce  artifacts  and  possible  false  alarms 
when  corrupted  pulses  arc  encountered.  The  Hopfield-Kamgar  algorithm  handles  the 
problem  by  first  partitioning  the  pulses  into  an  initial  number  of  groups,  and  then 
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merging  the  groups  by  examining  their  pairwise  normalized  standard  deviation.  The 
approach  requires  that  a  threshold  be  set  according  to  some  subjective  judgment,  thus 
increasing  the  demand  for  interactions  between  command  users  and  the  system.  In  [9], 
Wong  et  al.  formulated  the  pulse  deinterleaving  as  a  multivariate  clustering  problem. 
They  used  the  Minimum  Description  Length  (MDL)  for  cluster  validation.  The 
approach  is  computationally  exhaustive  since  for  each  assumed  number  of  cluster, 
complex  clustering  procedures  have  to  be  performed.  Recently,  Zhou  and  Lee  [10]  have 
developed  a  novel  approach  for  estimating  the  number  of  emitters  from  a  set  of  received 
pulse  trains.  In  the  approach,  it  was  discovered  that,  by  formulating  the  pulse  data  in 
certain  ways,  the  number  of  emitters  can  be  determined  from  the  number  of  significant 
eigenvalues  of  the  covariance  matrix  of  the  pulse  data.  The  MDL  criterion  was  derived 
for  determining  the  optimal  number  of  emitters.  The  approach  is  also  capable  of 
deinterleaving  the  pulses  at  the  same  time.  The  approach  was  statistically  consistent 
and  has  the  advantage  of  not  requiring  any  subjective  threshold  setting.  When  the 
number  of  received  pulses  is  modest,  the  approach  is  computationally  efficient  since  it 
does  not  involve  iterative  procedures.  However,  in  an  EW  signal  environment,  the  data 
rate  can  be  very  high,  and  the  number  of  intercepted  radar  pulses  can  reach  up  to  106 
per  second.  For  the  approach  by  Zhou  and  Lee,  this  means  an  eigen-decomposition  of  a 
huge  covariance  matrix,  making  it  intractable  and  even  impractical. 

In  this  report,  we  extend  the  approach  by  Zhou  and  Lee  to  handling  large  number  of 
pulses.  The  contribution  of  this  report  is  two-fold.  First,  the  signal  model  is 
reformulated  for  accepting  large  numbers  of  pulses.  Secondly,  the  general  information 
theoretic  criteria  are  applied  to  estimating  the  number  of  emitters.  In  [10],  the  size  of 
the  covariance  matrix  is  determined  by  the  number  of  pulses  received.  When  the 
number  of  pulses  is  large,  the  eigen-decomposition  of  the  covariance  matrix  becomes 
impractical.  In  this  report,  the  pulse  data  is  arranged  in  such  a  way  that  the  size  of  the 
covariance  matrix  is  dependent  on  the  number  of  samples  within  pulses,  and  the 
number  of  pulses  has  no  impact  on  the  dimension  of  it.  It  is  shown  that  the  the  number 
of  significant  eigenvalues  of  the  covariance  matrix  is  equal  to  that  of  the  emitters 
present.  The  general  information  theoretic  criterion  is  proposed  for  estimating  the 
number  of  significant  eigenvalues.  The  GIC  is  a  generalized  version  of  the  regular 
information  theoretic  criteria.  The  regular  criteria  arc  usually  obtained  through  the 
maximization  of  the  likelihood  function  while  the  GICs  arc  constructed  based  on  the 
r-regulated  functions.  The  GICs  do  not  involve  any  computationally  sophisticated 
maximum  likelihood  estimators.  We  show  that  the  criterion  in  [10]  is  a  special  case  of 
the  GIC,  where  the  1-regulated  function  is  used.  Under  certain  conditions,  they  arc 
strongly  statistically  consistent,  i.e.,  the  estimated  number  of  emitters  converges  to  the 
actual  value  as  the  number  of  pulses  approaches  infinity. 

The  report  is  organized  as  follows.  In  Section  2,  the  pulse  signal  model  is  formulated 
and  discussed.  In  Section  3,  the  general  information  theoretic  criteria  arc  applied  to 
estimating  the  number  of  emitters.  The  use  of  different  r- regular  function  is  discussed, 
and  the  strong  statistical  consistency  of  the  criteria  is  shown.  In  Section  4,  computer 
simulations  arc  used  to  demonstrate  the  effectiveness  of  the  approach. 
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2.  PULSE  SIGNAL  MODEL 


Assume  that  there  arc  K  emitters  which  transmit  narrowband  pulse  signals.  The 
receiver  is  assumed  to  be  capable  of  separating  simultaneous  pulses.  Assuming  that  the 
mth  pulse  is  from  the  Mi  emitter,  the  analytic  representation  of  the  mth  pulse  signal  at 
baseband  can  be  written  as 

(1)  xm{nT)  =  Smsk(nT  Tm )  +  wm(nT), 

where  T  denotes  the  sampling  interval,  sk(nT)  denotes  the  complex  pulse  waveforms 
from  the  mth  emitter  at  the  receiver,  <)m  represents  the  amplitude  and  phase-shift  on  the 
pulse,  and  rm  is  the  time  delays  relative  to  a  reference  time  origin.  The  parameter  5m  is 
complex.  Its  amplitude  represents  the  gain  fluctuation  of  the  receiver  and  the  received 
signal  power  which  may  vary  from  pulse  to  pulse  depending  on  the  antenna  pattern  and 
scanning  mode  of  the  emitter.  The  signal-to-noise  ratio  (SNR)  of  the  received  signal 
may  vary  from  45dB  (mainbeam  illumination)  to  15dB  (sidelobe  illumination)  for  a 
circular  scanning  emitter.  The  phase  of  5m  reflects  the  time  delay  on  the  signal  caused 
by  the  receiver.  It  is  known  that  the  sampling  of  a  digital  receiver  is  usually  controlled 
by  trigger  signals.  When  an  incoming  signal  is  corrupted  with  noise  or  its  power  varies, 
the  trigger  signal  and  its  pre/post  adjustment  may  cause  different  time  delays  on  the 
pulses.  Since  sk(t)  is  the  baseband  representation  of  a  narrowband  signal,  when  the 
sampling  rate  is  sufficiently  high,  the  time  delay  rm  can  be  ignored.  It  follows  that  the 
signal  model  (1)  can  be  written  as 


(2)  xm{n)  =  Smsk{n)  +  wm{n). 

The  input  noise  wm  ( n )  includes  the  medium  ambient  noise,  antenna  thermal  noise,  and 
the  circuitry  noise,  etc..  In  practice,  it  is  usually  dominated  by  the  circuitry  noise  since 
the  RF  pre-amplifier  of  the  receiver  is  always  designed  to  have  sufficient  gain  to 
overcome  the  additional  noise  sources  in  the  receiver.  For  digital  receivers,  the 
quantization  noise  introduced  by  the  analog-to-digital  converter  is  also  a  source  of 
noise.  We  make  the  following  assumptions  about  sk(t)  and  wrn{t). 

•  The  signal  waveforms  sk(n)  are  unknown  deterministic.  Let 

(3)  sk  =  [sk{l),sk{2),...-,sk{N)]% 

where  N  denotes  the  number  of  samples.  The  pulse  vectors  {s^;  k  —  1.2,....  K} 
are  linearly  independent. 

•  The  noise  process  wm(t)  is  additive  and  independent  identically  distributed  (, i.i.d .) 
Gaussian  process  with  zero  mean  and  an  unknown  variance  of  . 
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3.  Emitter  Number  Estimation 


Assume  that  data  collected  for  each  pulses  have  the  same  length.  The  mth  pulse  data 
model  (2)  can  be  written  in  a  vector  form  as 


(4) 


X_m  ~  ^fn—k  4“  “E-mi 


where  xm  =  [xm{l ), . . .  ,xm{N)]T  and  wm  =  [wm(l wm(iV)]T.  Denote 
X  =  \x1:x2, . . . ,  xM\.  We  have 


(5)  X  =  SA  +  W, 

where  S  =  [sx ,  s2, . . . ,  sK] ,  W  =  ,  w2, . . . ,  wM]  and  A  is  of  K  x  M  given  by 


(6) 


Si  0  ...  0  0 

0  5-2  ...  0  0 

0  ...  0  1  5n 


The  matrix  A  is  called  the  association  matrix.  Each  column  of  A  has  one  non-zero 
entry  which  associates  the  received  pulse  data  with  an  underlying  emitter.  For  example, 
a  non-zero  entry  at  the  kth  row  and  the  mth  column  position  would  mean  that  the  mth 
pulse  is  from  the  the  fcth  emitter.  Since  the  rows  of  A  arc  linearly  independent,  A  can 
be  verified  to  have  a  rank  of  K. 


3.1  Eigenvalue  distribution  of  Rx 

The  covariance  of  X  is  given  by 


(7)  Rx  =  E{{SA  +  W)(SA  +  W)H}  =  S{AAH)SH  +  o*I, 

where  E  is  the  expectation  operator.  Since  {s^;  k  —  1,  2, . . . ,  K}  arc  assumed  to  be 
linearly  independent,  S  has  a  full  column  rank  of  K.  In  addition,  since  A  has  a  rank  of 
K,  matrix  S(AAH)SH  can  be  verified  to  be  positive  semi-definite  with  rank  K .  The 
eigendecomposition  of  Rx  can  be  written  as 

K 

(8)  Rx  =  y^A  jUjuf, 

i— 1 
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where  {A,,  uf^  denotes  the  eigenvalues  and  corresponding  eigenvectors.  It  can  be 
verified  that  the  eigenvalues,  when  arranged  in  a  decreasing  order,  arc  given  by 


(9)  Ai  >  A2  . . .  >  Xk  >  Ajc+i  —  ^k+2  —  ■  ■  ■  —  Aat. 

where  the  last  ( N  —  K  +  1)  smaller  eigenvalues  arc  identical.  It  can  also  be  verified 
that  the  eigenvectors  associated  with  the  K  largest  form  the  basis  of  the  subspace 
spanned  by  the  columns  of  S. 

3.2  General  information  theoretic  criterion 

From  (9),  it  can  be  seen  that  the  number  of  emitters  can  be  determined  from  the  number 
of  large  eigenvalues  of  Rx.  In  this  section,  we  discuss  the  application  of  the  general 
information  theoretic  criteria  for  estimating  the  number  of  large  eigenvalues  of  Rx.  In 
the  context  of  array  processing,  there  have  been  recently  developed  many  different 
approaches  [12] [13].  They  arc  often  based  on  the  application  of  the  information 
theoretic  criteria  introduced  by  Akaike  [14],  Rissanen  [15]  and  Schwartz  [16]  et  ai.  In 
the  model  selection  problems,  the  information  theoretic  criteria  take  into  account  both 
the  goodness-of-fit  (likelihood)  of  a  model  and  the  number  of  parameters  used  to 
achieve  that  fit.  The  criteria  take  the  form  of  a  penalized  likelihood  function,  i.e.,  a 
negative  log  likelihood  function  plus  a  penalty  function.  The  general  information 
criteria  by  Yin  and  Krishnaiah  [11]  arc  different  from  the  regular  information  theoretic 
criteria.  They  arc  based  on  the  use  of  r-regulated  functions,  and  arc  considered  to  be  a 
general  versions  of  the  information  theoretic  criteria.  The  GIC  based  approaches  do  not 
involve  any  computationally  complex  and  sophisticated  maximum  likelihood 
estimators.  Some  of  the  main  results  in  [11]  are  re-stated  as  follows. 


r-regulated  function  Let  7Z  be  an  open  set  of  real  numbers  and  7Z2  =  71x71, 

R3  =  7f2  x  71, . . .,  be  the  Cartesian  powers  of  71.  Let  /  be  a  real  function  of  finite 
sequences  of  71.  Then,  /  is  r-regulated  if  the  following  conditions  are  satisfied. 

L  if  zi  =  z2  =  ■  ■  ■  =  zN  then  f(zi,z2i  ■  ■  -  ,zN)  =  f{zi). 

2.  if  z\,  z2,  ■  ■  ■,  zn  are  not  identical,  then  f(z\,z2,  ■  ■  ■ ,  £/v)  >  f(z-. 1). 

3.  For  each  k,  the  restrictions  of  /  on  7 Zk,  fr  belongs  to  CT+1(7lk),  and  the  partial 
derivatives  of  //;  up  to  order  r  arc  all  zero  on  the  set 

{(z1,z2, . . . ,  zN)  E  7&  :  Z!  =  z2  =  . . .  =  zk) 

Theorem:  Let  /  be  a  r-regulated  function  of  finite  sequences  of  TZ.  Let 

{Aj,  i  =  1,2,...,  N}  be  a  sequence  of  random  variables  for  i  =  1,  2, . . .  ,  N,  and  Aj 

takes  values  in  71.  If  the  following  conditions  exist. 
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1. 


A i  —  A i  |=  0 (ajtf)  as  N  — >  oo  for  i  —  1,  2, . . . ,  N,  where 

(10)  Ai  >  A2  >  ■  ■  ■  >  A k  =  A/r+i  =  ■  ■  ■  A N 

arc  non-random  constants  in  7 Z,  olm  >  0  arc  non-random,  and  om  — >  0. 
2.  Cm  are  non-random.  Cm  ~ >  0,  and  a^1  /Cm  ->  0. 

The  general  information  theoretic  criterion  takes  the  forms  of 

(11)  Q(k,  Cm)  —  /(Ajfc+i,  ■  ■  ■ ,  Aw)  +  A:Cm, 

and  the  number  if  is  estimated  as  the  value  of  k  that  minimizes  Q(k,  Cm)- 

(12)  if  =  argmin<3(A;,  Cm)- 

k 

Then,  if  converges  to  if  asymptotically,  i.e., 


(13)  if  -4  if,  M  -»■  00. 

and 


(14) 


Prob{K  ^K)  <N  ■  Prob 


(cM  <  £  I  A<M>  -  A«“)  I  j 


The  following  arc  some  examples  of  the  r-regulated  functions  (the  superscript  denotes 
the  degree  of  regularity)  [11], 


(15) 

/(1)0ft,^2,- 

■■,zN)  = 

-  Y log  Zk + N  log 

(16) 

fW{z1,Z2,. 

■■,ZN)  = 

^1- 

1 

< ,  v 

ip 

N  ^ 

\  i= 1 

(17) 

/(3)  (^1,  ^2,  - 

..,ZN)  = 

ii(- 

1  N 
-Y 

N  ^ 

3= 1 

(18) 

/(OO)0^2,. 

■■,Zn)  = 

(  N 

j  N  exp 

-  ^Zi 

NEj=iZj, 
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3.3  Emitter  number  estimation 


In  practice,  the  covariance  Rx  is  usually  not  available  and  can  only  be  estimated  by  its 
sample  covariance 


(19) 


Rx  =  —XX 
M 


H 


1 

M 


M 

i— 1 


Lemma:([  12],  Appendix)  The  sample  covariance  Rx  converges  to  Rx  with  probability 
one,  i.e., 

(20)  lim  Rx  =  Rx, 

M— ►  oc 


with  a  convergence  rate  proportional  to  ajy  given  by 


(21)  aM  =  y/  (log  log  M)/M . 

It  can  be  verified  that  liniM-roo  aM  =  0. 


Let  {Aj,  i  =  1,2,...,  N }  denote  the  eigenvalues  of  Rx.  Due  to  noise  and  finite 
number  of  samples  in  estimating  the  covariance,  {  A,; }  will  be  the  perturbed  version  of 
{Aj}  and  the  smaller  eigenvalues  will  no  longer  be  identical.  They  tend  to  separate  into 
two  categories  with  K  large  eigenvalues  and  the  rest  being  smaller  ones.  Let  rt]  and  f  tJ 
denotes  the  ijth  components  of  Rx  and  Rx,  respectively.  According  to  the  theorem  by 
Bai,  Miao  and  Rao  [13],  since  Rx  converges  to  Rx  at  a  rate  proportional  to  a m ,  i.e., 


(22)  |  rtJ  -  |=  aM  i,j  =  1,2,...,  N, 

there  exists  a  constant  B  independent  of  such  that 


(23)  |  A*  -  A*  |=  BaM , 

which  implies  that  the  eigenvalues  of  Rx  each  converge  to  those  of  Rx  with  a  rate 
dominated  by 

To  apply  the  general  information  theoretic  criterion,  a  non-random  Cm  is  needed  which 
satisfies  the  second  condition  in  the  Theorem.  For  criteria  (16),  (17)  and  (18),  we 
choose 


DRDC  Ottawa  TR  2002-156 


7 


(24) 


Cm  =  V  ■  (log  M/M)^r+1^2j 


where  rj  is  a  constant  and  r  is  the  degree  of  the  regularity.  It  can  be  verified  that,  when  rj 
is  a  constant. 


(25) 


and 


lim  r)  ■  (log  M/M)(r+1)/2  =  0, 

M— TOO 


(26) 


lim 


a 


7  + 1 

M 


m-too  ry(log  M/M)(r+1)/2 


=  0. 


For  criterion  (18)  which  has  an  infinite  regularity  degree,  we  simply  use 

Cm  =  V  ■  log  M/M.  It  can  be  verified  that  such  a  Cm  satisfies  the  conditions  in  the 

Theorem. 

Then,  from  the  Theorem ,  the  general  information  theoretic  criterion  can  be  constructed 
as 


(27)  Q(k)  —  f(Xk+i,  A/c+2,  . . . ,  A N)  +  k  ■  Cm, 


where  /  denotes  a  r-regulated  function.  Cm  is  given  by  (24)  and  k  =  0, 1. ,  N.  The 
optimal  number  of  emitters,  K,  is  given  by  the  one  that  minimizes  the  GIC  (27)  over  all 
possible  k.  The  strong  consistency  can  be  established  by  the  Theorem. 

Asymptotically,  the  selection  of  the  constant  r/  in  Cm  will  not  affect  the  strong 
consistency  of  the  GIC.  However,  for  finite  numbers  of  pulses,  the  performance  of  the 
estimator  is  dependent  on  r).  In  [18],  in  the  context  of  array  processing,  Wu  and 
Fuhrmann  did  some  studies  on  the  performance  of  the  estimator.  They  provided  the 
empirical  distributions  of  the  r-regulated  functions  and  a  strategies  for  selecting  rj.  It  is 
also  noted  that  GIC  based  on  the  1 -regular  function  (16)  has  the  same  form  as  the  MDL 
criterion  derived  in  [10],  which  involves  the  ratio  of  the  arithmetic  to  the  geometric 
mean  of  the  k  smallest  eigenvalues  of  the  sample  covariance  of  the  pulse  data  vector. 
Finally,  it  should  be  mentioned  that,  asymptotically,  the  degree  of  regularity  of  the 
r-regular  functions,  in  general,  impacts  the  speed  of  convergence  for  the  consistency  of 
the  GICs.  Usually,  the  higher  the  degree  of  regularity,  the  faster  the  convergence.  This 
is  only  true  asymptotically.  For  a  finite  N,  the  convergence  of  the  GICs  may  not  follow 
this  rule,  as  will  be  shown  in  the  simulation  studies. 
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4.  Numerical  Simulation  Studies 


Computer  simulated  data  is  used  to  demonstrate  the  effectiveness  of  the  proposed 
algorithm.  Four  emitter  arc  simulated.  The  signal  waveforms  arc  selected  from  the  real 
intercepted  radar  signals.  They  have  been  converted  into  the  analytic  forms  using 
in-phase  and  quadrature  (I/Q)  components.  The  length  of  the  pulses  arc  103,  99,  103 
and  104,  respectively.  We  use  the  maximum  N  =  104  as  the  pulse  data  vector  length. 
The  shorter  pulses  arc  expanded  to  this  length  by  appending  zeros  at  the  end.  Each 
pulse  is  simulated  to  have  a  twenty  percent  perturbation  relative  in  magnitude.  A  phase 
shift  uniformly  distributed  in  [0, 27r]  is  also  simulated.  The  pulses  arc  uniformly 
selected  among  the  four  emitters.  The  additive  noise  is  simulated  as  Li. cl.  Gaussian  with 
zero  mean  and  different  variances.  The  signal-to-noise  ratio  (SNR)  is  defined  as  the 
ratio  of  the  average  signal  power  to  the  noise  variance 


(28) 


SNR  =  101og10 


P 

2al' 


where 


(29) 


Ps 


1 

2  KN 


K  N 


EE 

k= 1 t=l 


Sk{t)  |2 


denotes  the  averaged  pulse  power. 

In  forming  the  GIC  for  determining  the  number  of  large  eigenvalues,  instead  of  using 
the  total  N  eigenvalues,  we  used  the  first  L  eigenvalues.  In  the  following,  we  discuss  its 
geometric  interpretations.  The  spectral  representation  of  :i^n  in  terms  of  the 
eigenvectors  of  Rx  can  be  written  as 


N 

(30)  xm  =  y^cmtut: 

i— 1 

where  crm  =<  ut ,  xm  >  is  the  product  inner  product  of  xm  and  ut.  It  is  known  that  Rx 
has  K  large  eigenvalues  and  the  rest  are  smaller  eigenvalues.  We  take  advantage  of  this 
a  priori  knowledge  by  retaining  only  the  first  L  eigenvectors  in  the  spectrum 
representation 


L 

(31)  Xm  =  ^  ^  CynilLi  • 

i= 1 
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It  follows  that  x,m  is  an  approximate  of  xm  with  a  reduced  dimension.  Let 
cm  =  [cTOi ,  crn2  ■ . . . ,  c,„/J7'  be  a  truncation  of  the  Karhunen-Loeve  transform  of  .x,m , 
which  is  the  most  efficient  transform  in  the  mean-square-error  sense.  The  truncated 
Karhunen-Loeve  transform  cm  can  be  considered  as  a  feature  extracted  from  xra  which 
has  a  reduced  dimensionality.  Let  Ul  =  ■  ■  ■  •  1A/J  ■  The  sample  covariance 

matrix  of  the  feature  vector  {crn:  m  =  l,2,...,M}  can  be  obtained  by 


(32) 


D  1 

Rc=m2^<x 


H 


=  ULRXUL  =  diag(\\,  A2,  -  -  - ,  A L), 


m=  1 


which  has  the  identical  eigenvalues  as  the  first  L  eigenvalues  of  Rx.  Thus,  if  we  apply 
the  GICs  to  the  feature  vectors,  it  is  identical  to  the  one  when  applied  to  the  first  L 
eigenvalues  of  Rx.  That  is,  the  use  of  the  first  L  eigenvalues  of  Rx  in  forming  the  GICs 
is  equivalent  to  applying  the  GICs  to  the  feature  vectors  of  the  pulse  data  vectors.  Note 
that  the  number  of  eigenvalues  L  should  be  at  least  greater  than  the  actual  number  of 
emitters. 

We  use  the  Monte  Carlo  approach  to  evaluate  the  performance  of  the  GIC  approach. 
First,  the  performance  of  the  GICs  via  different  number  of  pulse  for  different  SNRs  is 
examined.  Each  time,  the  number  of  eigenvalues  used  is  L  =  40.  Figures  1  to  3  show 
the  correct  detection  rate  via  the  number  of  pulses  for  SNR=  lOdB,  15dB  and  20dB, 
respectively.  In  the  figures,  notations  GIC1,  GIC2,  GIC3  and  GIC4  are  used  to  denote 
the  criteria  with  the  r -regular  functions  given  by  (16),  (17),  (18)  and  (18),  respectively. 
Each  test  is  repeated  200  times  to  obtain  the  correct  detection  rate.  In  the  examples,  r/  is 
selected  to  be  6,  0.001,  0.02  x  10  3,  and  10  4,  respectively,  for  GIC1,  GIC2,  GIC3  and 
GIC4.  These  numbers  certainly  arc  not  optimal.  They  arc  obtained  by  trying  different 
values.  It  can  be  seen  that,  for  different  SNRs,  the  detection  rate  increases  as  the 
number  of  pulses  increases.  They  gradually  approaches  the  value  1. 

Figures  4  and  5  plot  the  variation  of  correct  detection  rate  via  SNR  for  different  number 
of  pulses.  The  numbers  of  pulses  arc  M  =  100  and  200  in  Figures  4  and  5,  respectively. 
The  SNR  varies  from  4dB  to  20dB  with  a  step  of  2dB.  Each  test  is  repeated  200  times 
to  obtain  the  averaged  results.  It  is  observed  that  the  correct  detection  rates  improves  as 
the  SNRs  arc  increased,  and  they  approach  one  when  the  SNR  is  sufficiently  large. 

It  is  also  observed  from  the  plots  that  the  speed  of  convergence  does  not  necessarily 
agree  with  the  theory  which  says  that  the  higher  the  degree  of  regularity,  the  faster  the 
speed  of  convergence.  For  example,  in  Figure  1,  GIC3  with  a  degree  of  regularity  3  is 
seen  to  have  a  slower  convergence  rate  than  GIC2  which  has  a  degree  of  regularity  of  1. 
This  is  due  to  the  fact  that  the  number  of  pulses  in  use  is  finite. 


10 


DRDC  Ottawa  TR  2002-156 


5.  Conclusions 


In  this  report,  the  GIC  approach  has  been  presented  for  estimating  the  number  of 
emitters  from  a  set  of  interleaved  pulse  trains.  The  approach  is  able  can  handle  large 
number  of  pulses,  and  does  not  involve  the  usually  lengthy  and  difficult  maximum 
likelihood  estimator.  Computer  simulations  arc  used  to  demonstrate  the  effectiveness  of 
the  proposed  approach.  Simulation  results  showed  that  the  proposed  approach  was  able 
to  provide  satisfactory  performance  under  modest  SNR  and  finite  number  of  pulses. 
Note  that  further  investigations  arc  still  needed  for  the  selection  of  the  certain 
parameters,  and  will  be  presented  in  the  near  future. 
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Correct  detection  rate  (%) 


Figure  4:  Variations  of  the  detection  error  rate  versus  SNR.  The  number  of  pulses  is  M  —  100. 
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